2025-05-27 09:10:55.AIbase.18.4k
MiTA AI Search Launches New Ultra-Fast Model: Up to 400 tokens/second response speed
Recently, MiTA AI Search officially launched the new ultra-fast model, bringing users a more efficient and accurate search experience. The MiTA AI Search team successfully achieved a maximum response speed of 400 tokens/second on a single H800 GPU through kernel fusion technology on GPUs and dynamic compilation optimization strategies on CPUs, with most responses provided within 2 seconds.